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Many classes of line drawings such as contour maps appearing in weather 
data representation (e.g., isobars and isotherms), seismograph data etc., exhibit 
structure where 2, 3, or even more adjacent curves are very similar to one another. 
These pictures thus exhibit strong two dimensional dependences — along curve 
dependencies which show up as smooth curves with slowly changing curvatures, 
and the afforementioned curve-to-curve dependencies. In this work we discuss 
an approach to exploiting these dependencies in order to achieve high data 
compression. We take as a starting point one of the approaches to date of 
along curve compression through use of cubic spline approximation euid extend 
this by investigating the additional compressibility achievable through curve- 
to-curve structure exp/oitation . We report on one of the models under investiga- 
tion, namely that in which a curve is fairly well approximated as a linear 
transformation of an adjacent cxirve. This transformation includes any possible 
combination of a curve translation, two dimensional stretching, and rotation 
about an arbitrary axis. Hence, this coding might in concept be thought of as 
a linear least mean square eri^r predictive coding in which rather than predicting 
a new data point based on past data in a time series, the system is predicting a 
new curve based on preceding curves in a region. In practice formidable diffAi^- 


exist because of lack of an obvious indexing parameter and lack of any 
'isable statistical stationarity. Much weather map data consists of nested 
curves (representing hills), and for typical examples of this data we show that 
compression ratios of 1.5 to 2.0 above that achievable with spline approxima- 
tions to individual curves are possible. This compression is achieved through 
use of curve-to-curve dependencies in encoding knot points and no additional 
error is introduced. If some additional error is permitted, greater compressi^'n 
is possible. 


I 


INTRODUCTION 


Image data compression has become 6m extensive research area since computers 
have been more and more available for automatic storage, processing and trans* 
mission of pictorial data. This includes TV pictures used in space program, 
biomedical data, weather maps and cloud pictures in meteorology etc. Conven- 
tional method of scanning an image for transmission (for storage) generates a 
data sequence which usually contains considerable redundancy. So here the data 
compression problem is that of representing two dimensional data in a most 
efficient way without loosing any necessary information. 

In most of the work done on the subject, data sequences generated by line 
scanners are compressed by methods like run-length coding, transform coding, 
predictive coding, etc. [1-3]. For all of the above methods, statistical 
dependencies between successive sjrmbols in line scan is exploited. For many 
applications, it is possible to use the theory very successfully because data 
can be well defined as a stationary stochastic process. 

It is possible to get much higher compression ratios if two dimensional 

structures inherent in pictorial data are recognized. In that case, data 

sequence tc be encoded can be a set of parameters representing lines, curve 

potrfis In 

segments, etc. [4], or it can Le the sequence of coordinates of contour lines 

A 

in a picture [5J. Recently, Pavlidis [6] represented certain kinds of pictures 
and maps by piecewise linear surfaces. 

In this paper a two dimensional predictive coding me'rhod is applied to 
a class of line drawings which consist of groups of similar looking smooth 
curves such as appear in Weathermaps, seismographs etc. (Section II). The 
essential information for representing each curve is a number of knot points 
and the remainder of each curve is then reconstructed through use of cubic 
splines. In sections III and IV, we discuss a compression method applied to 


a group of closed curves (Figures 3, 4). Instead of encoding each knot point 
on each curve by picture coordinates. It's location is predicted by exploiting 
the similarities between curves* It should be mentioned that, evea though real 
data does not everywhere consist of closed curves as discussed in sections III 
and IV, the method can easily be extended to open curves. In case of single 
curves which do not look like any other curve in the picture, only the informa- 
tion contained on the same curve can be used for prediction. 

Predictive coding method used here is not a standard theoretical procedure 
(such as prediction using an autocorrelation function) since for data such as 
we deal with, standard stationary data sequence methods do not appear to be 
applicable because the nximber of degrees of freedom of these patterns is too 
small for the concept of stationarity to be meaningfully applied. However, 
linear transformations (section III) can be thought of as playing a structural 
description role here. Furthermore, since the linear transformation which is 
appropriate differs from one region of a picture to another, we have a 
situation analogous to that of a quasistationary time sequence where the auto- 
correlation function is different for different intervals and must be described 
to the receiver if it is to be used for purposes of data compression. 

Since a picture contains regions exhibiting a number of different tvoes of 

data structures — the nested contours treated in this paper being one type 

pattern recognition plays the important role of segmenting the picture field into 
regions each of which can then be encoded in the most appropriate way. 


II. WEATHERMAPS 


2.1 Previous Work on redundancy removal for weathermaps 

Figures 1 and 2 are typical examples of the meteorological maps that are 
constantly transmitted throughout weather stations for forecast purposes. The 
usual data sequence to be transmitted for these pictures is a sequence of binary 
symbols representing a line by line scan of a two gray level picture. There has 
been some work on efficient coding of these strings such as [!]» where constant 
output bit rate was used, and [2] where run-length coding schemes for source 
coding were used. The best reduction factor is about in these schemes. 

More recently, conic curves have been used to describe and transmit weather 
maps. The contours of constant pressure and temperature were approximated by 
conic curves such as a segment of a parabola or ellipse [3]. The parameters 
corresponding to these curve segments are encoded. Such a system is claimed 
to be capable of compressing the data substantially, often by two orders of magnitude. 

2.2 Picture Structure 

The structure that is observable in Figures 1 and 2 arises in some other 
applications as well, such as in geographic elevation mapping, cartography and 
seismology. In all of these drawings, there is more structural redundance than 
a contour being representad>le by a number of conic curves: a) By being smooth 

and slowly changing, a piece of a single curve carries information about the 
other parts of the same curve. That fact has been used before [4 3 for redundancy 
removal purposes, b) There are groups of nested curves that show very similar 
structures. (Some examples are circled in Figures 1 and 2). Once the properties 
of one of these curves is known, behavior of others in the same group can be 
predicted with small error. The goal in this paper is to utilize the property(b) 
above together with (a) to achieve gi eater compression by exploiting the regional 
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(group) dependencies as well as single curve dependencies. 

III. PROCESSING OF DATA 

3.1 Use of computer graphics 

An acoustic data tablet attached to the graphics facility was used to input 
the curves fTOm weathermaps to appropriate memory . The graphics facility was also 
used to manipulate the curves interactively, as will be explained in 3.2. Stored 
data was further processed by APL because vector and matrix operations can be 
programmed very quickly, resulting in easy experimentation emd debugging. 

3.2 Linear Transformations 

The Graphics facility was used to explore the relations between the curves 
within a group. (A group is roughly defined in Section 2.2.) The similarity of 
these curves suggests that a simple combination of linear transformations such as 
relocation, scaling and rotation can transform one curve in the group to another. 

In other words, with the use of a few parameters two curves can be made to look 
quite similar. Figures 3-6 show two examples of groups of closed curves and 
their re-shaped versions. (Uniformly scaled, rotated and shifted.) The purpose 
of the re-shaping operation is two-fold: 

1. The similarities and differences between these curves can be observed 
more easily. 

2. Re-shaping can be a part of the coding process. That is, if only a 

small number of transformation parameters is enough to transfonn one curve to 
another, it can be worthwhile to do the encoding in the following way: First, encode 

the suitable transformation parameters and one of the curves. Then encode only 

the differences between one curve and transformation of another. This is in fact 
what is done in this paper and we will return to this in Section 4.1. 


To b« able to realize the second purpose above* it is necessary to define a 
measure of distance between contours and to set up an algorithm for determining 
suitable parameters for transformations. 


3.3 Transformation Algorithms 

Define the distance between two curves, referred to as "1" and ”2" (Figure 7) 
as follows: Choose an appropriate number, say n, of points (spaced according to 

the variations in the curve; this matter will be discussed at the end of the section) 
on curve 1. For each selected point, find the closest point on Curve 2. The sum 
of the squares of distances between corresponding points Is an appropriate distance 
measure. Then, the problem of Interest In this section Is to find a linear trans- 
formation (a set of parameters) for curve 2 such that the distance between the 

transformed curve (call It 2^) and curve 1 Is minimum. 

11 2 2 
Let (x^, y^^) denote the i’th point on curve 1 and (x^,yj) denote the j'th 

point on curve 2. The transformation of Interest is; 
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Generalize equation 1 to n points 
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where MV is the parameter vector [m^^ m^^ mg] 

The least square solution to MV is given by 

/\ T -1 T 1 

MV s (A A) A (3) 

. „1 r 1 1 1 1 1 InT 

where P = [x^ ^1 ^2 ^2 *’* ’‘n ^n^ 

As it can be observed, the parameter vector MV provides for operations of 

scaling, rotation and translation. The four parameters m^, mj, m^, and m^ provide 

for a combination of scaling and rotation operations, m^ and mg are translation 

parameters for the x and y coordinates respectively following scaling and rotation. 

it 

2 

The choice of points selected for distance measure will effect MV and p 
If points are chosen closer together at the regions where curves are rapidly 
changing (and possibly not very similar) and if they are chosen far apart at the 
slow changing regions where curves look quite alike, it is possible to get a better 
fit. The same effect can be achieved by weighting the errors more heavily in some 
regions than in others, causing parameters to define a transformation in favor of 
heavily weighted regions. 


3.4 Spline Representation 

In this work. Instead of representing curves by conic sections, parametric 
cubic spline approximations are used because they are found to be convenient to 
work with since spline fitting algorithms have. nice properties such as automatically 
guaranteed slope continuity at the knot points, and smoothness of spline functions. 

For periodic splines (closed curves) the only Information to be coded Is the 
coordinates of the knot points. For non-perlodlc splines, additional 2 end-point 
slope Information Is required. (See appendix). If the same number of knot points 
are assigned to two similar curves , It Is possible to express one curve In terms 
of the other. This Is discussed In the next section. 
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IV. CODING PHILOSOPHY 


H.l Discussion of coding method 

Consider one typical group of closed curves. Figure 4. In this section we 
discuss an adaptive predictive coding method for the knot points on one of the 
curves^given all knot points on the adjacent one, and we roughly estimate the 
costs (code length in bits) involved in the coding process. The problem of pre- 
dicting knot points on a single curve in terms of other knot points on the same 
curve r.in be handled by known methods (such as curve fitting) and will not be 
discussi c here. It will be enough to say that a reduction factor of 1.4 is 
possible over coding each knot point individually for the weathermap data in the 
examples shown. 

Figure 8 shows the result of one global transformation applied tc 3 inner 

most curves of figure 4. The transformation minimizes the sura of the errors 

* ft 

in approximating each curve by a transformation of an adjacent curve. 2,3, 

and 4* are the transformed versions of 2, 3, and 4. The problem of interest 

ft ft 

is to encode the differences between 1 and 2,2 and 3 etc. in a most efficient 
way. The reasons for using one transformation above instead of using optimal 
transformations for 1 and 2 and for other pairs separately is a compromise between 
better overall fits and the cost of encoding transformation parameters. We will 
return to this later in the section wl.ere overall costs and reduction factors 
are discussed. 

Consider curves 1 and 2 . The set of points to be encoded on 1 are shown 

ft 

by '0”s. They are determined by spline algorithms. The knot points for 2 

are determined by a procedure that will be discussed later in the section. 

a 

The points to be encoded on 2 are shown by 'X'*s in the figure. (Curve 2 is 

ft 

recovered from 2 at the receiver through use of inverse transformation applied 


to 'X"t( then by the use of spline algorithms on these transformed points.) As 
expectedt although most parts of the two curves are very close, portions of 
them are significantly different due to local variations which cannot be 
avoided by a single linear transformation. So, variability in local behavior 
suggests a cooing method adaptive to local characteristics. That can be obtained 
by partitioning the curves into sets of regions and defining different predictors 
for different sets. 

In figure 8, the field is divided into two regions SI and S2. SI is the 

a 

region where curves 1 and 2 are quite close, and S2 is the region where the 
two curves are very dissimilar. In general, SI and S2 will be unions of 
neperated sets, as is the case in figure 8, where A is one such subset of SI. 


When referring to a predictor for any point i on curve 2 , we will in 

2 2 " 

theory mean the conditional expectation of point i « (x^ * ^i ^ whose conditional 
probability density function can be expressed by: 


2* 2 I 1 1 1 1 2 2 2* 2* V 

yi*’****n* ^n* **1 * ^1 ***** **i-l* ^i-l' 


where S^ * {Xj^^ y^,k * l,...n) is the set of all knots on curve 1 

and S^_j^ » {x^ , y^ ,k ■ 1,... i - 1) is the set of knots on 2 that 

are already known. 

2 * 

Then, the predictor m^ is defined as 

”i “ ^^**i • ^i I ^i-1^ 

11 2 * 

For SI above, we observe in the examples that the effect of (x. , y.) on >n. is 

2 * 

much greater than is the effect of the other points on m^ . It is reasonable to 


assume that 


2” 1 V 2^ 2* 

®i * ^**i* ^** » y^ J 


1 1 , 


likely to be equal to the corresponding (x^, y^) on 1. 


J * 


FUrth«mor«, for Si , (x^, ) Is taksn to bs ths Intsrssction of curvs 2 

with ths psrpsndlcular line to ths spline approximation to curvs 1 at point 

11 1 1 2* 2** 

(x^typ. Then only 1 error variable, the distance from (x^,yj^) to (x^ ty^^ ), 

need be encoded and transmitted. 

For S2 it is more difficult to decide which paz*ticular subsets of and 

2 * 2 * 

have the greatest interdependence with m^ . We will assume that 

2* 11 11 
that is, m^ is a function only of (x^ y^), the distance between 

2^ 2*^ ^ 
and (Xj^ y^^ ^), and the last k points that are already known on 2 . 

For S2, choosing knots on 2*'^ by using perpendiculars from 1 does not always 

result in suitable points because of fast changes in either one of the curves. 

Our approach is to choose knots on 2^ such that chordlengths between corresponding 

pairs of knots on each of the two curves are of constant ratio, which is equal to 

the ratio of the total lengths of curve segments over this particular region. 

((Pft i/Pa* 1 *) * ** figure 9.) P. . is the length of the segment 

between points i and j. For successive points, ^ can be approximated by the 

distance between these points.) In that case, the z^ceiver has to know the ratio 

r, prior to receiving information about knots on 2 for that region. 

2 * 

Then, the predictor m. will be s<^mewhere on the circle CR with racius r and 
2 * 2 * 

center located at (x, i* might be determined by one of the following 

2 * 2 * 2 * 2 * 

algorithms: (1) Fit a cubic curve C to the points yi-k^* 

Find the intersection of C with CR, (2) Find the intersection of circle CR with 

11 1 2 * 2 1 2 * 2 1/2 
circle with center at (x^, y^) and radius " **1-1^ * ^^i-1 " ^i-1^ ^ 

In any case, assume that the receiver wants to locate point P^ in figure 9, 

1 1* 2 2*. • .2 

while points P , P , and P are known. Assum that mf is fou..j to be the point P . 

citorJ* - ^ 

The transmitter has to encode only the difference ber^reen the lengths of”P^ P^oaoI p*'p^ 


Since |P^ P^l is known by the receiver, |P^ P^ | is calculated. Since IP"*” | 


2 „2 


A * 

1 2 


and |P^P^ I arc known also, the angle P'*’ P^P^ is established and hence P^ is 


1 « 2„2 


determined. 
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In figure 8, eight of the n ■ IS points fell in SI and seven fall in S2. 

We assume that each picture is quantized into 128 x 128 binary matrix form. 

A maximum of three bits per point in SI and S bits per point in S2 are required 
for encoding each d^. Another cost involved is the identification of each point 
belonging to one of the sets. That can be efficiently performed by fiz*st 
identifying a region belonging to one of the sets, which requires only one bit 
in our example, then encoding the number of knots belonging to the region. 
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requiring roughly four bits per region. Obviously, the above procedure should 

be perlormed before encoding the points for each region. The cost of encoding 

* 

curve 2 is '>«73 bits, including the cost of encoding r. 

e 

The number of bits z*equired to encode the differences for 2 and 3 is 

ft 

found by first defining the spline knots on 2 (which is reconstructed from 2 at 

the receiver) by an algorithm known by both receiver and transmitter. Then 

ft 

the same procedure that is used for 1 and 2 is used for encoding.Total cost of 
ft ft 

encoding 3 and 4 is 'V'lO? bits. 

There are 6 transformation parameters to be encoded, 2 of them representing 

translation. The translation parameters require 8 bits each (including sign). 

The other 4 are related to scaling and rotation, requiring approximately 8 and 

7 bits respectively for error-free reproduction at the receiver. Total cost 

for transformation parameters then is 46 bits. 

Would it be beneficial to apply a different transformation for each pair 

of curves and then to encode the differences between sets of transformation 

parameters? Fig\ire 8 shows the result of a transformation applied to curve 4 

ftft 

to make it look like curve 3 (4 ). Encoding the difference between transformation 

ft 

parameters is estimated to require about 16 bits, but newfit 4 gives rise to a 
saving of 8 bits. Thus, we conclude that it does not appear to be worthwhile to 
use different transformations for adjacent curves, but rather one optimal 
transformation for the whole group. 

A rough indication can be given of the saving in bits provided by this 
coding over encoding the knot points without using compression and with using single 
curve compression. Hence, we compare the cost of encoding for 4 curves in figure 8 
using our method with the other two mentioned above, a) The cost of encoding 4 
curves by encoding the knot points without using any compression is calculated by 
assuming that encoding each knot requires 14 bits (2 x2 matrix). Spline approximation 
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TX 
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requires a total of *^*44 points , resulting In a total cost of 'vSlB bits, b) Encoding 
the knot points \islng dependence along Individual curves require 10 bits for each 
of 44 points, requiring a sum of 440 bits, c) Total cost for our method Is the sum 

of: 1. Coding curve 1 by Itself, 2. Cost of transformation parameters, 3. Cost 

e * * 

of encoding curves 2,3, and 4 . Total cost for 'c' Is 'v*380 bits. 

We conclude that a reduction factor of '><1.6 Is obtained for figure 8 over 
encoding knot points Individually. It must be mentioned that In the example 
above, 3 Innermost curves are transformed to adjacent outermost ones. That 
requires the number of knot points to be largerthan the case where 3 outer- 
most curves would be transformed to adjacent Inner ones. (Total word- 
length would be shorter for the second case.) It would be possible to get 
compression ratio of 'v>1.8 for the second case. 

4.2 Results and Conclusions 

In this section, we will summarize the suggested coding method step by 
step and discuss the results. A code might be constructed as follows" 

(1) Divide the whole region so that groups of similar curves are formed. 

(2) For groups that contain only one curve, encode each curve separately, 
by estimating each knot point using knots that are already known on the same 
curve. Here, each knot will require about 10 bits. 

(3) For each of the other groups: 

(a) First encode one of the curves as in (2) above 

(b) Encode the linear transformation parameters, requiring about 
46 bits total for one transformation 

(c) Encode the other curves In the group with respect to the one 
encoded In (a) and the transformation In (b). On the average, each knot 
point can be specified by 'V'5 bits Including the specification of the region 
It belongs. 


14 - 


The code constructed above Is able to compress weathermap or similar data 
with a ratio between 1.5 and 2 compared to the coding of individual knot points. 

The discussed compression ratio can be Improved if Huffman coding is used 
for encoding the symbols. Also, the relations between groups in one picture can 
be used for compression, such as exploiting the similarities between transforma- 
tion parameters. 
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